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Abstract 

Objective: To evaluate the validity of cancer-specific and generic preference-based instruments to discriminate 
across different measures of cancer severities. 

Methods: Patients with breast (n = 66), colorectal (n = 57), and lung (n = 61) cancer completed the EORTC QLQ- 
C30 and the FACT-G, as well as three generic instruments: the EQ-5D, the SF-6D, and the HUI2/3. Disease severity 
was quantified using cancer stage, Eastern Cooperative Oncology Group Performance Status (ECOG-PS) score, and 
self-reported health status. Comparative analyses confirmed the multi-dimensional conceptualization of the 
instruments in terms of construct and convergent validity. 

Results: In general, the instruments were able to discriminate across severity measures. The instruments 
demonstrated moderate to strong correlation with each other (r = 0.37-0.73). Not all of the measures could 
discriminate between different groups of disease severity: the EQ-5D and SF-6D were less discriminative than the 
HUI2/3 and the cancer-specific instruments. 

Conclusion: The cancer-specific and generic preference-based instruments demonstrated to be valid in 
discriminating across levels of ECOG-PS scores and self-reported health states. However, the usefulness of the 
generic instruments may be limited if they are not able to detect small changes in health status within cancer 
patients. This raises concerns regarding the appropriateness of these instruments when comparing different cancer 
treatments within an economic evaluation framework. 

Keywords: Quality of life, cancer-specific instruments, generic instruments, external validity: responsiveness, disease 
severity, utilities 



Introduction 

Cancer is the leading cause of death in many developed 
countries. In Canada, the latest statistics confirm that can- 
cer-related mortality is now higher than mortality from 
circulatory diseases [1]. As such, the demand for effective 
and efficacious treatments is rising. New cancer therapies 
are being developed, and approved, with the aim of 
improving a patient's prognosis [1,2]. These treatments, 
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however, often have a detrimental effect on the patient's 
quality of life (QOL). As the therapies are administered in 
accordance to the patient's severity level, it is important to 
have a valid QOL instrument which can discriminate 
across all levels of disease severity. This is of importance 
to oncologists as defining prognostic determinants may 
aid in the stratification of randomization on known prog- 
nostic factors in clinical trials and in therapeutic decision- 
making in routine practice while maintaining a high level 
of QOL for the patient. 

QOL can be evaluated using either disease-specific or 
generic preference-based instruments. Disease-specific 
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instruments have the capacity to detect minimal changes 
in a specified health condition [3]. In oncology, the two 
most widely used instruments to assess QOL are the 
European Organization of Research and Treatment in 
Cancer (EORTC) Quality of Life Core 30 (QLQ-C30) 
and the Functional Assessment of Cancer Therapy - 
General (FACT-G) [4,5]. While the advantage of using 
cancer-specific instruments is their capacity to detect 
minimal changes in a disease, these instruments are not 
suitable for comparisons across different disease states. 
As a result, the use of generic preference-based instru- 
ments is a better option. The advantage of generic 
instruments is that they integrate different aspects of a 
health state into a single index anchored by a value of 
one for perfect health and zero for dead. This value can 
be combined with the length of time in that health state 
to generate a quality-adjusted life year (QALY), a metric 
used in economic evaluations [6]. The most commonly 
used generic preference-based instruments are the Euro- 
Qol 5D (EQ-5D), the Short Form 6D (SF-6D), and the 
Health Utilities Index (HUI) [7-9]. 

The use of generic preference-based instruments can 
be incorporated into a general health policy model to 
compare the efficiency of different programs or treatment 
strategies [6]. This provides a framework for decisions 
concerning the adoption of new treatments within a pub- 
licly-funded health care system. However, generic instru- 
ments typically cover dimensions of health such as 
mobility, pain, activity limitation, and anxiety or depres- 
sion; these dimensions may not be sensitive, or relevant, 
to treatment effects for the health condition under inves- 
tigation [3], This may be due, in part, as to why many 
cancer trials do not include generic preference-based 
instruments; instead, focusing on cancer-specific instru- 
ments to evaluate outcomes of patients. 

Currently, the validity of QOL instruments to discrimi- 
nate between different levels of cancer severity has not 
yet been adequately evaluated. Therefore, the objective of 
this study is to evaluate the validity of cancer-specific and 
generic preference-based instruments in terms of their 
ability to distinguish between different measures of can- 
cer severity. Disease severity was measured in three ways: 
cancer stage; Eastern Cooperative Oncology Group 
Performance Status (ECOG-PS) score; and patient- 
reported general health status. 

Methods 

Study Participants 

To participate in the study, patients had the following 
criteria: be diagnosed with either breast, colorectal, or 
lung cancer; be 18 years and older; be able to speak and 
read English; have a life expectancy of at least six months; 
be without cognitive impairments; and have plans to 
return to an appointment with a medical oncologist. 



Breast, colorectal, and lung cancer were chosen as they 
are among the most common cancers diagnosed in Brit- 
ish Columbia and Canada [1,2]. Recruitment and 
informed consent were undertaken by a medical oncolo- 
gist. Consented patients were given the questionnaires to 
complete at a subsequent outpatient visit at the Vancou- 
ver Cancer Clinic. 

To complete the study, patients had two options avail- 
able to them. The instruments could be completed face- 
to-face with a trained research assistant at the patient's 
appointment. Alternatively, the patients could take the 
instruments home and post the completed forms in a pro- 
vided pre-paid envelope. For both options, researchers 
were available to answer questions if needed. The order of 
the QOL instruments was randomized for each partici- 
pant. The study protocol was approved by the Research 
Ethics Board of the British Columbia Cancer Agency. 

The study was piloted with 66 cancer patients at the 
Vancouver Cancer Clinic. The objectives of this pilot 
study were, not only to determine the practicality of col- 
lecting five QOL measures in terms of administration and 
respondent burden, but also to estimate the median, mean 
and standard deviation (SD) of the different QOL mea- 
sures; the latter provided the estimates to calculate a sam- 
ple size for the main study. 

Based on results from the pilot study and other prefer- 
ence-based instruments, a difference of 0.05 in mean uti- 
lity measures of health states is considered important and 
meaningful [10,11]. Using 80% power to detect a differ- 
ence in mean health state of 0.05 between different sever- 
ity groups and assuming that the common SD is 0.10 
using an independent t-test at the 5% significant level indi- 
cates that a minimum sample size of 32 in each group is 
needed. The mean (SD) for the EQ-5D, SF-6D, HUI-2, 
and HUI-3 were 0.81 (0.17), 0.71 (0.11), 0.83 (0.13), and 
0.76 (0.23), respectively. To compare differences in mean 
health scores, we will need a sample of 62, 38, 53, and 167 
respondents for EQ-5D, SF-6D, HUI-2, and HUI-3 respec- 
tively. We therefore collected data from 182 patients. 

Data Collection 

Socio-Demographic and Clinical Information 

Data pertaining to the patients' socio-demographic infor- 
mation were obtained using a self-administered question- 
naire. The patient's cancer status, in terms of disease 
stage and ECOG-PS score, was extracted from their med- 
ical records. The stage of cancer is typically classified 
from stage 1 to stage 4; the higher the stage, the more 
aggressive and fast growing the cancer. The ECOG-PS is 
a single item rating of the degree to which patients are 
able to participate in typical activities without a need for 
rest. The scale, ranging from zero (fully active) to five 
(dead), assesses disease progression and its impact on the 
patient's daily living abilities [12]. The ECOG-PS was 
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chosen because it is a powerful predictor of QOL and an 
important concern in cancer care [13]. In addition to the 
clinical measures, the patient provided a self-reported 
health status; this was measured on a seven-point 
response scale, ranging from excellent to fair to extre- 
mely poor. 

Cancer-Specific Instruments 

For this study, two cancer-specific instruments were 
used: the EORTC QLQ-C30 and the FACT-G [4,5]. The 
QLQ-C30 dominates cancer clinical trials in Canada and 
Europe, while the FACT-G is more widely used in the 
USA [14]. The QLQ-C30 and the FACT-G contain dif- 
ferent items even though they cover the same scales or 
dimensions, respectively. 

The QLQ-C30 is a 30-item questionnaire composed of 
multi-item scales and single items to reflect the multidi- 
mensional nature of QOL in cancer [4]. It incorporates 
five functional scales (i.e., physical, role, cognitive, emo- 
tional, and social), three symptom scales (i.e., fatigue, pain, 
and nausea and vomiting), and seven single items (i.e., dys- 
pnea, appetite loss, sleep disturbance, constipation, and 
diarrhea); these are measured on a four-point response 
scale. The instrument also contains an item assessing the 
perceived financial impact of the disease and treatment 
and two seven-point response scales pertaining to global 
health and QOL. While the QLQ-C30 does not yield an 
overall score, a global health status score was created from 
the patients' responses to the two response scales relating 
to global health and QOL [15,16]. 

The fourth version of the FACT-G consists of 27 items 
covering four dimensions of well-being: physical, social/ 
family, emotional, and functional [5] . Items within these 
dimensions are evaluated on a five-point response scale. 
Using the instrument developer's algorithms, an overall 
score and four dimension scores can be generated with 
higher scores reflecting better QOL. Reliability and valid- 
ity, including responsiveness, of the instrument have been 
well documented in cancer trials and clinical settings 
[17-19]. 

Generic Instruments 

The EQ-5D questionnaire consists of a general health 
descriptive system based on five items and a 100-point 
visual analogue scale. The five items cover mobility, self 
care, usual activities, pain/discomfort, and anxiety/depres- 
sion with three levels per item (i.e., no problem, some pro- 
blems, and extreme problems). The instrument describes 
243 possible health states, which are assigned utilities 
based on country-specific algorithms developed by the 
EuroQol group. The most widely used utility algorithm 
was based on a time trade-off (TTO) survey of 2997 UK 
respondents [9]. Recently, Shaw et al [20] developed a uti- 
lity algorithm based on TTO responses from 4048 US resi- 
dents. In the absence of a Canadian algorithm, this was 
used to calculate EQ-5D utilities for this study. 



The SF-6D was constructed from a sample of 11 items 
selected from the Short Form 36 (SF-36). These items 
were valued by a representative sample of the UK general 
population using the standard gamble (SG) [21,22]. This 
is a six-dimensional health state classification system 
with each dimension having four to six levels; therefore, 
18,000 health states are described. In place of a Canadian 
utility algorithm, the UK population tariff was used. 

A version of the HUI instrument that combines features 
of the HUI mark 2 (HUI-2) and HUI mark 3 (HUI-3) was 
used in this study. The HUI2/3 contains 15 items that 
focuses on aspects of vision, hearing, speech, emotion, 
pain, mobility, dexterity, cognition, and self-care; each 
item was defined by four to six levels. Using the responses 
on the HUI2/3, two different utilities were estimated using 
an algorithm developed from random samples of the 
Canadian population: one for the HUI-2 and one for the 
HUI-3 [7]. 

Data Analysis 

Descriptive statistics were used to characterize the sample 
in terms of age, sex, marital status, ethnicity, employment 
status, education level, and annual income. In addition to 
these socio-demographic variables, the patients were char- 
acterized by disease severity. Continuous variables are pre- 
sented as means and SDs while categorical variables are 
presented as the proportion of the sample within each 
group. The QOL scores of the investigated instruments 
are reported as Tukey's values. 

Before testing the ability of the instruments to discrimi- 
nate across disease-severity measures, the psychometric 
properties of the cancer-specific instruments in terms of 
internal consistency and construct validity were examined. 
Internal consistency was evaluated using Cronbach's alpha 
coefficient [4,5] and convergent validity using correlation 
coefficients [23] . Construct validity assesses whether scales 
from different instruments, measuring similar dimensions 
of QOL, are strongly correlated with each other. Both 
parametric and non-parametric (Pearson and Spearman) 
correlation coefficients were calculated; however, as the 
results were statistically similar, results from the Pearson's 
correlation coefficients are reported. A coefficient of 
greater than 0.5 or less than -0.5 indicates a strong corre- 
lation between instruments, 0.30 to 0.49 or -0.49 to -0.30 a 
moderate correlation, and values between 0.30 to -0.30 a 
weak correlation [24]. We also compared the correlation 
between the general health scores from the cancer-specific 
instruments and the utility indices from the preference- 
based instruments. A Bonferroni correction was applied to 
counteract the problem of multiple comparisons [25]. 

The external validity of each instrument was assessed 
based on its ability to discriminate between different can- 
cer severity as represented by cancer stage, ECOG-PS 
score, and self-reported health status. This was determined 
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using the instruments' global scores. Patients with the 
greatest disease severity (i.e., cancer stage 4, ECOG-PS 
score 3, and very poor self-reported health) were hypothe- 
sized to have lower QOL scores across all instruments. 
One-way analysis of variance (ANOVA) evaluated the dif- 
ferences among QOL scores when stratified by the afore- 
mentioned variables of disease severity. 

The effect size, the standardized mean difference 
between two groups on a measured outcome, was also cal- 
culated. Each of the disease severity variables were sub- 
divided into two meaningful groups of sufficient size: can- 
cer stages 1-2 versus cancer stages 3-4; ECOG-PS 0 versus 
ECOG-PS 1-3; self-reported health status excellent-good 
versus self-reported health status fair-very poor. Stage 3 
and stage 4 were grouped together because the aim was to 
compare late stage disease with those patients in stages 1 
and 2 (early disease stages). The reason why the ECOG-PS 
1-3 categories are collapsed together is due to the small 
number of patients in PS 3 (n = 5). The decision was 
made to differentiate patients who reported "no problem" 
with their daily lives (PS 0) and those who reported some 
level of problems (PS 1-3). While it might appear slightly 
counterintuitive to combine the fair self-rating with poor 
and very poor, the decision was based purely on the num- 
ber of patients belonging in the two groups: excellent- 
good (n = 116) and fair-very poor (n = 63); including the 
'fair' respondents with the 'excellent-good' would result in 
only 27 patients in the 'poor-very poor' group. 

An effect size of one indicates a clinically meaningful 
change in magnitude equivalent to one standard deviation 
(SD). The absolute value of effect sizes (d) can be categor- 
ized as small [d = 0.2-0.5), medium (d = 0.5-0.8), or large 
(d > 0.8) [26]. By comparing the effect sizes across the dif- 
ferent cancer-specific and generic preference-based instru- 
ments, their discriminative abilities can be assessed 
[26,27]. All analyses were performed using the ST ATA 
statistical software package, version 11.1 [28]. 

Results 

Patient Characteristics 

One hundred and ninety five patients were approached 
to participate in the study. All gave consent to partici- 
pate in the study. The questionnaires were completed 
by 184 patients; a high response rate of 94% was 
achieved. The average (SD) time to complete the study 
was 22.3 (8.9) minutes. Most patients required no assis- 
tance in completing the instruments. 

The socio-demographic and clinical characteristics of 
the patients are described in Table 1. The majority of 
patients were females (65%) and the mean (SD) age was 
58.5 (11.5) years. In total, the patient sample consisted of 
66 (36%) with breast cancer, 57 (31%) with colorectal can- 
cer, and 61 (33%) with lung cancer. Although half of the 
patients were reported to be in cancer stage 4, 64 (36%) 



had an oncologist-reported ECOG-PS score of 0 (i.e., fully 
active, able to carry on all pre-disease performance with- 
out restriction); no ECOG-PS score worse than 3 was 
reported. Most of the patients reported to being in very 
good (26%) and good (29%) health states. As only five 
patients had an oncologist-reported ECOG-PS score of 3, 
these individuals were combined with the adjacent group 
to form the ECOG-PS 2-3 group. 

Quality of Life Scores 

Table 2 displays a summary of the QOL scores obtained 
from the instruments used in this study. For the generic 
preference-based instruments, a maximum score of 1.0 
was achieved but the minimum values varied. The SF-6D 
and HUI-3 had interquartile ranges (IQRs) of 0.14 and 
0.17, respectively, which is lower than those of the EQ- 
5D (IQR = 0.22) and the HUI-2 (IQR = 0.31). The mean 
(SD) values between the two cancer-specific instruments 
differed; such that patients valued their QOL higher 
using the FACT-G (81.61 (14.14)) when compared to the 
QLQ-C30 (68.90 (20.36)). Seventeen (9%) patients had a 
best possible score for the global health status score of 
the QLQ-C30; none provided the best possible score for 
the FACT-G. Fourteen of these participants gave a score 
of greater than 0.95 for the HUI-2 and HUI-3; 11 gave 
the best possible scores for the EQ-5D, and the SF-6D. 

Paired t-tests indicated no significant differences in 
mean scores of the generic preference-based instru- 
ments between females and males; married and not 
married; and Caucasian and non-Caucasian (results not 
presented). Mean EQ-5D and HUI-2 scores were found 
to be higher for more educated participants. We found 
no significant differences in mean values of the cancer- 
specific QOL scores when stratified by sex and age. 

Internal Consistency and Convergent Validity 

Cronbach's a coefficients for the QLQ-C30 and FACT- 
G scales are shown in Table 3. Both instruments met 
the minimum standard for reliability (a = 0.70). In gen- 
eral, correlations between the QLQ-C30 and the FACT- 
G were high when scales and sub-scales were related to 
the same QOL domain and low when they related to 
different domains (Table 4). A high correlation was 
observed between FACT-G physical well-being and the 
role function (r = 0.64) and physical function scale (r = 
0.55) of QLQ-C30. The functional well-being of the 
FACT-G was highly correlated with the role functioning 
(r = 0.61) and the physical functioning (r = 0.58) of the 
QLQ-C30. The social domains of QLQ-C30 and FACT- 
G were poorly correlated (r = 0.13), but the emotional 
subscales were strongly correlated (r = 0.76). The 
FACT-G global score was highly correlated with all 
QLQ-C30 domains, with the exception of cognitive 
functioning. 
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Table 1 Socio-demographic and clinical characteristics of the patients 



Frequency (%) or Mean (± SD) 



All Cancers (n = 184) Breast Cancer (n = 66) Colorectal Cancer (n = 57) Lung Cancer (n = 61) 



Female 119(65) 

Age 58.5 (±11.5) 

Marital status 

Single 24(13) 

Married/living with a partner 120 (65) 

Divorced/separated/widowed 37 (20) 

Ethnicity 

Caucasian 85 (46) 

Asian 26 (14) 

Other ethnicity 70 (38) 

Employment 

Full time 55 (30) 

Retired 66 (36) 

Unemployed 42 (23) 

Education 

Primary school 51 (28) 

Secondary school 15(8) 

College/University 9 (5) 

Other 9 (5) 

Annual income (CAD) 

<$29,999 42 (24) 

$30,000-$59,999 55 (32) 

$60,000-$99,999 35 (20) 

> $100,000 42 (24) 

Stage of disease 

1 15(8) 

2 31 (17) 

3 46 (25) 

4 92 (50) 
Eastern Cooperative Oncology Group 

0 64 (34.8) 

1 96 (52.2) 

2 1 5 (8.2) 

3 5 (2.7) 
Self-reported general health 

Excellent 14 (7.6) 

Very good 48(26.1) 

Good 54 (29.3) 

Fair 36(19.6) 

Poor 22(12.0) 

Very poor 5 (2.7) 



66 (100) 
53.2 fct 10.9) 

11 (17) 
42 (64) 

1 3 (20) 

32 (48) 

11 (17) 
22 (33) 

26 (39) 
15 (23) 
17 (26) 

14 (21) 

3 (5) 
45 (68) 

4 (6) 

12 (19) 

17 (27) 

15 (24) 

1 8 (29) 

5 (8) 

26 (66) 
8 (12) 

27 (41) 

33 (50.0) 
27 (40.9) 

1 (1.5) 

2 (3.0) 

1 (1.5) 

20 (30.3) 

21 (31.8) 
11 (16.7) 
8 (12.1) 

3 (4.5) 



25 (44) 
60.1 fct 11.1) 

6 (10) 
41 (72) 
10 (18) 

29 (51) 

7 (11) 

1 8 (32) 

20 (35) 
32 (52) 

13 (23) 

20 (35) 
7 (12) 
29 (51) 

1 (2) 

14 (25) 

15 (27) 
9 (16) 

16 (29) 

2 (3) 
2 (3) 

1 8 (32) 
27 (48) 

18 (31.6) 
33 (57.9) 

4 (7.0) 

1 (1.8) 

9 (15.8) 
16 (28.1) 
20 (35.1) 
7 (12.3) 

4 (7.0) 

1 (1.8) 



29 (48) 
63.0 fct 9.8) 

8 (13) 
37 (61) 
14 (23) 

24 (61) 
10 (17) 
28 (46) 

10 (16) 
20 (35) 
18 (11) 

1 7 (29) 
7 (12) 

31 (53) 
4 (7) 

1 6 (29) 
24 (54) 
10 (18) 

7 (13) 

8 (14) 
3 (5) 

20 (34) 
35 (59) 

13 (21.3) 
36 (59.0) 
10 (16.4) 
2 (3.3) 

4 (6.6) 

12 (19.7) 

13 (21.3) 
18 (31.0) 
10 (16.4) 

1 (1-6) 



The correlation between the cancer-specific and the 
generic preference-based instruments was positive and, 
in general, moderate (Table 5); stronger correlations 
were observed between the FACT-G and the HUI-2 (r = 
0.64) and HUI-3 (r = 0.61). The QOL scores from the 
three generic instruments moderately to strongly corre- 
lated with each other (r = 0.38-0.70). 



Discriminant Validity and Effect Size 

Table 6 illustrates the relationships between the QOL 
scores and all investigated measures of cancer severity. 
In general, the relationships between QOL and disease 
severity demonstrated a monotonic gradient, such that a 
lower QOL was associated with greater disease severity 
(i.e., higher cancer stage and ECOG-PS score, and 



Teckle et al. Health and Quality of Life Outcomes 201 1 , 9:1 06 
http://www.hqlo.eom/content/9/1 /1 06 



Page 6 of 10 



Table 2 Quality of life scores of the instruments 



Instrument 


Mean 


SD* 


Median 


IQR* 


Min. 


Max. 


QLQ-C30 


68.90 


20.36 


66.67 


25.00 


0.00 


100.00 


FACT-G 


81.61 


14.14 


83.92 


18.83 


40.00 


107.00 


EQ-5D 


0.83 


0.14 


0.83 


0.22 


0.11 


1.00 


SF-6D 


0.73 


0.11 


0.74 


0.14 


0.44 


1.00 


HUI-2 


0.76 


0.23 


0.84 


0.3 1 


-0.04 


1.00 


HUI-3 


0.83 


0.13 


0.88 


0.17 


0.30 


1.00 



* SD: standard deviation; IQR: interquartile range. 



poorer self-reported health status). This expressed the 
ability of the instruments to discriminate between differ- 
ent levels of cancer severity, thereby supporting validity 
for all instruments for this specific population. The 
results revealed that there is an absence of a linear gra- 
dient with the generic preference-based measures when 
stratified by the patient's cancer stage; this was sup- 
ported by the ANOVA results. 

Table 7 shows the effects of the cancer severity variables 
used in this study. Effect sizes calculated from the two can- 
cer-specific instruments exceeded Cohen's low limits of 
0.2. The QLQ-C30 (d = 0.40) and the FACT-G (d = 0.49) 
were generally better able to discriminate among the 
patients with early and late stage disease as indicated by 
the larger effect sizes. However, amongst the generic pre- 
ference-based instruments, the HUI-2 (d = 0.36) and the 
HUI-3 (d = 0.24) performed better than the EQ-5D (d = 
0.06) and the SF-6 (d = 0.10). Similar trends were observed 
for the ECOG-PS score and patient self-reported health 
status. 

Discussion 

The key finding of this study is that the global scores of 
the QLQ-C30 and the FACT-G and the mean utility 

Table 3 Internal consistency and ceiling-floor effects for 
the EORTC QLQ-C30 and FACT-G 



Scores mean (SD) a 1 a 2 



EORTC-QLQ-C30 








Physical functioning 


77.53(19.49) 


0.78 


0.77 


Social functioning 


72.12(26.23) 


0.77 


0.77 


Emotional functioning 


78.43(21.12) 


0.81 


0.81 


Cognitive functioning 


80.56(21.90) 


0.82 


0.81 


Role functioning 


72.83(26.35) 


0.76 


0.76 


Global health status 


68.75(20.46) 


0.78 


0.78 


FACT-G subscale 








Physical Weil-Being 


21.38(5.11) 


0.79 


0.82 


Social/Family Weil-Being 


23.13(4.09) 


0.82 


0.69 


Emotional Weil-Being 


18.38(4.36) 


0.87 


0.74 


Functional Weil-Being 


18.65(5.54) 


0.83 


0.80 


Global health status 


81.50(14.22) 


0.71 


0.89 



Notes: V = Cronbach's alpha coefficient. 

2 a = Cronbach's alpha coefficient original version of QLQ-C30 by Aaronson 
and colleagues [4] and of FACT-G by Cella and colleagues [5]. 



Table 4 Pearson Correlations between the QLQ-C30 and 
FACT-G sub-scales 





QLQ-C30 


PF 


RF 


EF 


CF 


SF 


FACT-G 


0.629 


0.522 


0.598 


0.658 


0.394 


0.542 


PWB 


0.553 


0.551 


0.637 


0.504 


0.443 


0.545 


SWB 


0.1 93 + 


0.128 + 


0.073 + 


0.1 87 + 


0.1 64 + 


0.1 30 + 


EWB 


0.335 


0.1 88 + 


0.343 


0.761 


0.201 + 


0.315 


FWB 


0.686 


0.579 


0.607 


0.460 


0.321 


0.524 



Notes: QLQ-C30 = EORTC-QLQ-C30 global score; PF = Physical functioning; RF 
= Role functioning; 

EF = Emotional functioning; CF = Cognitive functioning; SF = Social 
functioning. 

FACT-G = FACT-G global score; PWB = Physical well-being; SWB = Social well- 
being; 

EWB = Emotional well being; FWB = Functional well-being 

All correlations are significant at 0.05 level after Bonferroni corrections 

applied, except for + 

scores from the EQ-5D, SF-6D, and HUI2/3 are able to 
distinguish between cancer severity measures, namely 
the stage of cancer and ECOG-PS scores. The QLQ-C30 
and FACT-G appear to perform better than the generic 
preference-based measures, as indicated by higher effect 
size coefficients. The EQ-5D performed less favourably 
than the SF-6D and HUI2/3 in discriminating patients 
between the cancer severity measures used in this study. 
This result confirms what previous studies have found 
regarding the unresponsiveness nature of the EQ-5D 
when compared with other disease-specific instruments 
[3,29-32]; this may be a result of the instrument having 
only three levels to define each item and only five items. 
Notably, in the field of cancer many patients report hav- 
ing low energy and vitality. The EQ-5D does not include 
an item for energy or vitality. 

The comparison with the QLQ-C30 needs to be inter- 
preted with care as an overall summary score was not 
obtained for this instrument. Instead, the comparison was 
made using the two items asking patients to rate their 
overall health and overall QOL during the past week 
(items 29 and 30). It is possible that patients may not have 
considered all aspects that contribute to their QOL when 
providing a rating for these items; thereby resulting in an 

Table 5 Pearson correlations for the quality of life scores 
for all instruments 

QLQ-C30 FACT-G EQ-5D SF-6D HUI-2 HUI-3 

QLQ-C30 1 .00 



FACT-G 


0.59 


1 .00 










EQ-5D 


0.43 


0.50 


1.00 








SF-6D 


0.48 


0.47 


0.62 


1.00 






HUI-2 


0/1 1 


0.64 


0.48 


0.38 


1.00 




HUI-3 


0.44 


0.61 


0.68 


0.51 


0.70 


1.00 


EQ-VAS 


0.73 


0.51 


0.43 


0.45 


0.40 


0.44 



All correlations are significant at 0.05 level after Bonferroni corrections 
applied. 
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Table 6 Relationship between cancer severity variables and the QOL scores 








Mean score (SD) 






QLQ-C30 


FACT-G 


EQ-5D 


SF-6D 


HUI-2 


HUI-3 


Stage of cancer 














1 


73.89 (12.94) * 


87.71 (12.59) * 




n 7 a rn ns'i * 


0.84 (0.14) * 


0.81 (0.21) * 


2 


74.72 (16.74) * 


84.31 (13.54) * 


n Q/i rn 1 ^ 


u./ j i uj 


0.88 (0.10) * 


0.79 (0.23) * 


3 


70.64 (22.27) * 


82.63 (1 1.87) * 


0.85 (0.14) 


0.76 (0.1 1) * 


0.86 (0.1 1) * 


0.80 (0.22) * 


4 


65.02 (21.16) * 


78.98 (15.37) * 


0.82 (0.14) 


0.71 (0.11) * 


0.81 (0.15) * 


0.71 (0.23) * 


ECOG-PS 1 














0 


76.46 (17.93)* 


84.93 (13.97)* 


0.83 (0.21)* 


0.77 (0.11)* 


0.87 (0.15)* 


0.83 (0.19)* 


1 


67.81 (20.47)* 


81.11 (14.55)* 


0.78 (0.15)* 


0.72 (0.08)* 


0.80 (0.19)* 


0.75 (0.21)* 


2-3 


52.08 (15.97)* 


71.50 (12.13)* 


0.70 (0.18)* 


0.71 (0.10)* 


0.76 (0.14)* 


0.61 (0.24)* 


Self-reported health status 














Excellent - very good 


80.46 (17.41)* 


88.33 (9.89)* 


0.88 (0.14)* 


0.78 (0.09)* 


0.89 (0.09)* 


0.84 (0.20)* 


Good - fair 


67.79 (15.03)* 


81.78 (13.23)* 


0.83 (0.13)* 


0.72 (0.09)* 


0.84 (0.13)* 


0.77 (0.22)* 


Poor - very poor 


46.47 (19.74)* 


65.78 (13.04)* 


0.71 (0.11)* 


0.61 (0.06)* 


0.71 (0.14)* 


0.54 (0.22)* 



* Comparison of mean values (using ANOVA), P < 0.05. 
1 Eastern Cooperative Oncology Group performance status 

inaccurate estimate. Inter-domain correlations for the two 
cancer-specific instruments (e.g., between physical and 
emotional domains) were strong. However, the correlation 
between the social domains of the two cancer-specific 
instruments was weak. The weak correlation between 
these domains indicated that the scales tend to measure 
different aspects of social problems that cancer patients 
face. The FACT-G social domain is primarily concerned 
with aspects of social life whereas the social functioning 
scale of the QLQ-C30 is designed to address important 
limitations in family and social life caused by physical 
complaints [4,17,33]. Such a difference, as replicated by 
results of this study, indicated that these two QOL instru- 
ments are designed to measure different aspect of QOL 
and therefore may not interchangeable. 

The main advantage of using cancer-specific instru- 
ments is their items are more appropriate to the condition 

Table 7 Effect sizes of the cancer severity variables 



under investigation, unlike generic preference-based 
instruments, which incorporate broad domains covering 
all aspects of QOL. Furthermore, most items in the inves- 
tigated instruments, except those in the HUI2/3, incorpo- 
rate aspects of coping and adaptation. These items address 
the fact that patients may gradually learn to cope and 
adapt to their limitations in a number of ways such that, 
over time, the perception of the impact of their disease 
may be reduced. Previous studies have shown that cancer 
patients' emotional and functional well-being increase in 
the absence of corresponding increase in physical well- 
being, suggesting adaptation to physical limitations 
[34-36]. This process will have an impact on their overall 
QOL. 

In addition to the description of the items, the valua- 
tion methods and the psychometric properties of the 
generic instruments may provide another explanation 



Mean score (SD) 



QLQ-C30 



FACT-G 



EQ-5D 



SF-6D 



HUI-2 



HUI-3 



Stage of cancer 

1-2 74.44(15.43) 85.47 (13.18) 0.84(0.14) 0.73 (0.09) 0.87(0.12) 0.80(0.22) 

3-4 66.85 (21.61) 80.20 (14.36) 0.83(0.14) 0.72 (0.11) 0.82(0.14) 0.74(0.23) 

Effect size 0.49* 0.40* 0.06* 0.10* 0.36* 0.24* 

ECOG-PS score 1 

0 76.43 (17.79) 86.12 (11.24) 0.86 (0.16) 0.77 (0.11) 0.88(0.09) 0.83 (0.19) 

1-3 64.90 (20.70) 79.27 (14.83) 0.81(0.11) 0.71(0.10) 0.81(0.15) 0.72 (0.22) 

Effect size 0.65* 0.61* 0.31* 0.57* 0.50* 0.57* 

Self-reported health status 

Excellent-good 76.92 (16.06) 86.45 (10.93) 0.87 (0.13) 0.76 (0.09) 0.88(0.11) 0.83 (0.19) 

Fair-very poor 54.57 (17.83) 73.01(15.05) 0.75 (0.12) 0.66 (0.09) 0.75(0.14) 0.63 (0.23) 

Effect size 1.39 1.23 0.90 1.13 1.18 1.04 



* Comparison of mean values (using ANOVA), p < 0.05. 
1 Eastern Cooperative Oncology Group performance status 
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for the differences observed between the instruments. 
The SF-6D and the HUI2/3 use the SG technique for 
valuation, while the EQ-5D uses the TTO approach. 
The HUI2/3 uses multi-attribute utility (MAU) theory 
and multiplicative scoring models, while the other 
instruments use additive scoring methods. Although the 
scoring function for the HUI2/3 is derived from Cana- 
dian general public, the EQ-5D and SF-6D are based on 
a non-Canadian population. Furthermore, the scoring 
functions of the MAU preference-based instruments 
were derived from responses of the general public. As a 
result, this raises concerns as to whether the scoring 
functions of the EQ-5D and SF-6D best reflect the pre- 
ferences of Canadian cancer patients, especially consid- 
ering the fact that members of the general public do not 
often include aspects of adaptation into their valuations. 

The responsiveness of the instruments needs to be 
evaluated longitudinally; this was difficult to evaluate due 
to the cross-sectional nature of the study. If a treatment 
strategy results in a minimum clinically important differ- 
ence, the instruments will need to be able to detect this 
change. The most important question is whether these 
instruments are sensitive to changes in QOL; this can 
only be assessed in a longitudinal study. However, this 
study does investigate whether the QOL scores are corre- 
lated with cancer stage and ECOG-PS score. This is one 
of the strongest parts of this study given that the indivi- 
dual performance of the QOL instruments has been 
assessed previously. Results for the stage of cancer, how- 
ever should be interpreted with caution due to the small 
size of patients for stage one (N = 15) and the issue of 
adaptation. In this study, we do not have information on 
time since diagnosis. This may influence the patients to 
adapt to different health states. We believe, however, the 
self-reported measures of general health would capture 
some of the adaptation effect. 

The evaluative nature of these instruments also needs 
to be assessed, as it would be beneficial not only to mea- 
sure improvements in QOL with cancer treatments but 
also to compare these QOL scores with those obtained 
for other conditions over the longer term. There is also a 
need to examine the measurement properties of these 
instruments in patients with different cancer tumour 
sites and in different settings. While the patients in the 
study were attending an outpatient visit at the cancer 
centre, we did not have access to information as to the 
type of treatment they were receiving at the time of com- 
pleting the questionnaire. As such, assessing the differ- 
ences in QOL between, for example, chemotherapy and 
radiotherapy patients could not be examined. We recog- 
nize this as a limitation, and hope to gather this informa- 
tion in a subsequent study. 

As health is a function of both quality and length of 
life, the QALY is used to measure health outcomes in 



economic evaluation to compare the efficiency of differ- 
ent programs or treatment strategies in the health care 
system. For utilities to be of value, the scores obtained 
from these generic instruments need to be incorporated 
into a QALY measure of resource allocation decision- 
making. However, conducting a cost-utility analysis using 
the QOL values obtained in this study, only small 
changes will be observed when using generic instruments 
especially when comparing treatments for different can- 
cer stages. Combined with the poor sensitivity to detect 
subtle changes in QOL, these results indicate that generic 
preference-based instruments may not be appropriate for 
comparing cancer treatments. As such, a cancer-specific 
preference-based measure would need to be developed to 
overcome the limitations of using generic instruments. A 
measure such as this would ensure that the utilities used 
in economic evaluation better reflect the impact of the 
health condition under investigation [29,37-40]. This is 
achieved by developing an algorithm to map between the 
cancer-specific and generic preference-based instru- 
ments; results from such a study are beyond the scope of 
the current work and will be presented in a future paper. 

In conclusion, cancer-specific and generic preference- 
based instruments were demonstrated to be valid in discri- 
minating across levels of ECOG-PS scores and self- 
reported health status. However, the usefulness of the gen- 
eric instruments may be limited if they are not able to 
detect small changes in health status within cancer 
patients. This raises concerns regarding the appropriate- 
ness of these instruments when comparing different can- 
cer treatments within an economic evaluation framework. 
The results demonstrate that the SF-6D and HUI2/3 
appear to be better at discriminating patients between dif- 
ferent severities of disease than the EQ-5D. 

Researchers and practitioners should be mindful that 
some instruments may have greater 'sensitivity' to captur- 
ing QOL experiences in cancer patients. Administering 
both cancer-specific and generic preference-based mea- 
sures in clinical trials will still allow valuable information 
to be gained. The simultaneous use of both types of 
instruments would allow researchers to develop a statisti- 
cal algorithm to map between the cancer-specific and 
generic preference-based instruments; results from such 
a study will be presented in a future paper. Given the 
importance relevance of this research topic, further work 
is merited. 
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